Vocabulary Extension Recognition System for a based Speaker - Adaptive on CVC Units

نویسندگان

  • Peter Fesseler
  • Heidi Hackbarth
  • Marianne Kugler
چکیده

For speech recognition with large vocabularies, a user should not be burdened with having to train several thousand words explicitly. Therefore, it proves extremely useful to provide a means for easy vocabulary generation and enlargement from written text input. Applying a set of appropriately defined rules, the orthography of a lexicon item is first transcribed into the phonetic symbols of the standard pronunciation and the most common alternatives thereof. From these, the multiple sequence of specific subword units of a lexicon entry is produced. The tool introduced here is part of a comprehensive speech processing system for subword-unit based, speaker-adaptive recognition of continuous speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum Likelihood Lineartransformations for Hmm

This paper examines the application of linear transformations for speaker and environmental adaptation in an HMM-based speech recognition system. In particular, transformations that are trained in a maximum likelihood sense on adaptation data are investigated. Other than in the form of a simple bias, strict linear feature-space transformations are inappropriate in this case. Hence, only model-b...

متن کامل

Closed-Set Speaker Identification Based on a Single Word Utterance: An Evaluation of Alternative Approaches

The problem of closed-set speaker identification based on a single spoken word from a limited vocabulary is relevant to several current and futuristic interactive multimedia applications. In this paper, we evaluate the effectiveness of several potential solutions using an isolated word speech corpus. In addition to evaluating the text-dependent and text-constrained variants of the Gaussian Mixt...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Combined Optimisation of Baseforms and Model Parameters in Speech Recognition Based on Acoustic Subword Units

A major challenge in speech recognition is creating a lexicon which is robust to inter-and intra-speaker variations. This is even more so in speech recognisers based on non-linguistic units, e.g., acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the baseforms describing the vocabulary words in terms of the recognition units need to be generated fr...

متن کامل

A unified language model for large vocabulary continuous speech recognition of Turkish

We have designed a Turkish dictation system for newspaper content transcription application. Turkish is an agglutinative language with free word order. These characteristics of the language result in vocabulary explosion, large number of out-of-vocabulary (OOV) words and an increased complexity of n-gram language models in speech recognition when words are used as recognition units. In this pap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006